Simple learning models can illuminate biased results from choice titration experiments

نویسندگان

  • Abran Steele-Feldman
  • James J. Anderson
چکیده

The choice titration procedure presents a subject with a repeated choice between a standard option that always provides the same reward and an adjusting option for which the reward schedule is adjusted based on the subject’s previous choices. The procedure is designed to determine the point of indifference between the two schedules which is then used to estimate a utility equivalence point between the two options. Analyzing the titration procedure as a Markov birth death process, we show that a large class of reinforcement learning models invariably generates a titration bias, and that the bias varies non-linearly with the reward value. We treat several titration procedures, presenting analytic results for some simple learning models and simulation results for more complex models. These results suggest that results from titration experiments are likely to be biased and that inferences based on the titration experiments may need to be reconsidered.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rational Choice Theory: A Cultural Reconsideration

Economists have heralded the formulation of the expected utility theorem as a universal method of choice under uncertainty. In their seminal paper, Stigler and Becker (Stigler & Becker, 1977) declared that “human behavior can be explained by a generalized calculus of utility-maximizing behavior” (p.76). The universality of the rational choice theory has been widely criticized by psychologists, ...

متن کامل

Temporally-Biased Sampling for Online Model Management

To maintain the accuracy of supervised learning models in the presence of evolving data streams, we provide temporally-biased sampling schemes that weight recent data most heavily, with inclusion probabilities for a given data item decaying exponentially over time. We then periodically retrain the models on the current sample. This approach speeds up the training process relative to training on...

متن کامل

Learning in Economics Experiments

Reinforcement learning, belief learning, experiments, probability matching, market price-choice games, computer simulations. This paper explains how simple psychological models of reinforcement and belief learning can be used to explain dynamic patterns of adjustment in economics experiments.

متن کامل

Using the XCS Classifier System for Multi-objective Reinforcement Learning Problems

We investigate the performance of a learning classifier system in some simple multi-objective, multi-step maze problems, using both random and biased action-selection policies for exploration. Results show that the choice of action-selection policy can significantly affect the performance of the system in such environments. Further, this effect is directly related to population size, and we rel...

متن کامل

Theoretical Models of Learning to Learn

A Machine can only learn if it is biased in some way. Typically the bias is supplied by hand, for example through the choice of an appropriate set of features. However, if the learning machine is embedded within an environment of related tasks, then it can learn its own bias by learning suuciently many tasks from the environment 4, 6]. In this paper two models of bias learning (or equivalently,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013